Testing perfection is hard 



Noga Alon* Jacob Fox"!" 



Abstract 

A graph property V is strongly testable if for every fixed e > there is a one-sided e-tester for 
V whose query complexity is bounded by a function of e. In classifying the strongly testable graph 
properties, the first author and Shapira showed that any hereditary graph property (such as V the 
family of perfect graphs) is strongly testable. A property is easily testable if it is strongly testable 
with query complexity bounded by a polynomial function of e~^, and otherwise it is hard. One of 
our main results shows that testing perfectness is hard. The proof shows that testing pcrfcctness 
is at least as hard as testing trianglc-frceness, which is hard. On the other hand, we show that 
induced Pa-frceness is easily testable. This settles one of the two exceptional graphs, the other 
being C4 (and its complement) , left open in the characterization by the first author and Shapira of 
graphs H for which induced i?-freeness is easily testable. 

1 Introduction 

Property testing is an active area of computer science where one wishes to quickly distinguish between 
objects that satisfy a property from objects that are far from satisfying that property. The study 
of this notion was initiated by Rubinfield and Sudan [22], and subsequently Goldreich, Goldwasser, 
and Ron [H] started the investigation of property testers for combinatorial objects. Graph property 
testing in particular has attracted a great deal of attention. A property P is a family of (undirected) 
graphs closed under isomorphism. A graph G with n vertices is e-far from satisfying V if one must 
add or delete at least eii? edges in order to turn G into a graph satisfying V. 

An e-tester for 'P is a randomized algorithm, which given n and the ability to check whether there is 
an edge between a given pair of vertices, distinguishes with probability at least 2/3 between the cases 
G satisfies V and G is e-far from satisfying V. Such an e-tester is one-sided if, whenever G satisfies 
7-", the e-tester determines this with probability 1. A property V is strongly-testable if for every fixed 
e > there exists a one-sided e-tester for V whose query complexity is bounded only by a function of 
e, which is independent of the size of the input graph. 

We call a property V easily testable if it is strongly testable with a one-sided e-tester whose query 
complexity is polynomial in e~^, and otherwise V is hard. This is analogous to classical complexity 
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theory, where an algorithm whose running time is polynomial in the input size is considered fast, 
and otherwise slow. Call a hereditary graph property extendable if for all but finitely many graphs 
in the family, there is a larger graph in the family containing it as an induced subgraph. Most of 
the well-known hereditary graph properties are extendable. As mentioned briefly in [3] and proved in 
detail in |15j . there is a universal one-sided e-tester for extendable hereditary graph properties which 
has query complexity at most quadratic in the minimum possible query complexity of an optimal 
one-sided e-tester. Indeed, it samples d random vertices (for some d), and if the subgraph they induce 
is in V, it accepts, and otherwise it rejects. The query complexity of this tester is (2), and it is at 
least as accurate as any tester with query complexity at most d/2. The query complexity is a lower 
bound for the running time of an e-tester, and, if there is a polynomial time recognition algorithm for 
membership in V, the running time is polynomial in the query complexity. So while query complexity 
and running time are different notions, they are often of comparable order. 

For a graph H, let Vh denote the property of being H-hee, i.e., it is the family of graphs which do not 
contain H as a subgraph. The triangle removal lemma of Ruzsa and Szemeredi [23] is one of the most 
influential applications of Szemeredi's regularity lemma. It states that for every e > there is > 
such that any graph on n vertices with at most 6n^ triangles can be made triangle-free by removing 
at most ev? edges. The triangle removal lemma is equivalent to the fact that Vk^ is strongly testable. 
Indeed, the algorithm samples t = 26~^ triples of vertices uniformly at random, where 6 is picked 
according to the triangle removal lemma, and accepts if none of them form a triangle, and otherwise 
rejects. Any triangle-free graph is clearly accepted. If a graph is e-far from being triangle-free, then it 
contains at least 6n^ triangles, and the probability that none of the sampled triples forms a triangle at 
most (1 - 5)* < 1/3. Notice that the query complexity depends on the bound in the triangle removal 
lemma. As observed by Ruzsa and Szemeredi, the triangle removal lemma gives a simple proof of 
Roth's theorem [21] that every dense subset of the integers contains a 3-term arithmetic progression. 
From Behrend's construction [7], which gives a large subset of the first n positive integers without a 
3-term arithmetic progression, it follows that S < e'^^"^'^ in the triangle removal lemma. This implies 
that testing triangle-freeness is hard. Indeed, in the universal algorithm described earlier, in a random 
sample of d vertices, the expected number of triangles is at most Sd"^, and hence in the universal 
one-sided e-tester for triangle-freeness, 1/3 < 5d^, or equivalently, d > {36)~^^^. As discussed earlier, 
the query complexity of any one-sided e-tester for triangle-freeness is at least d/2. 

The triangle removal lemma was extended in [3] (see also [2]) to the graph removal lemma. It says 
that for each e > and graph H on h vertices there is (5 = 5(e, H) > such that every graph on n 
vertices with at most copies of H can be made H-bee by removing at most en^ edges. The graph 
removal lemma similarly implies that testing i?-freeness is strongly testable. The proof, which uses 
Szemeredi's regularity lemma, gives a bound on the query complexity which is a tower of height a 
power of e~^. This was somewhat improved recently by the second author [12] to a tower of height 
logarithmic in e^^. The first author [T] showed that i?-freeness is easily testable if and only if H is 
bipartite. 

For a graph H, let denote the property of being induced H-hee, i.e., it is the family of graphs 
which do not contain H as an induced subgraph. The graph removal lemma was extended by the 
first author, Fischer, Krivelevich, Szegedy [3] to the induced graph removal lemma, which states that 
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for every e > and graph H on h vertices there is > such that any graph on n vertices with at 
most 6n'^ induced copies of H can be made induced H-iree by adding or removing at most e-n? edges. 
The induced graph removal lemma is equivalent to the fact that, for any graph H, the property 
is strongly testable. The proof, which uses a strengthening of Szemeredi's regularity lemma, gives a 
bound on the query complexity which is wowzer of height a power of e~^, which is one higher in the 
Ackermann hierarchy than the tower function. This has recently been improved by Conlon and the 
second author jlUj to the tower function. 

The length of a path is the number of edges it contains, and we let denote the path of length k. 
The first author and Shapira [1] showed that for any graph H other than the paths of length at most 
3, a cycle of length 4, and their complements, testing induced iJ-freeness is hard. For H a path of 
length at most 2 or their complements, induced iif-freeness is easily testable. They left open the cases 
that H \s a. path of length 3 or a cycle of length 4 (and equivalently its complement). Here we settle 
one of the two remaining cases. 

Theorem 1.1 Induced P^-freeness is easily testable. 

A well-known result of Seinsche [23] gives a simple structure theorem for induced P^-fiee graphs. These 
graphs, also known as cographs, are generated from the single vertex graph by complementation and 
disjoint union. This is equivalent to the statement that every induced Pa-free graph or its complement 
is not connected. 

A quite general result of the first author and Shapira [5] states that every hereditary family V of 
graphs is strongly testable. They further asked which hereditary graph properties are easily testable, 
and, in particular, for a few of the well-known hereditary families of graphs, including perfect graphs 
and comparability graphs. 

Note that the chromatic number of a graph is at least its clique number as the vertices of any clique 
must receive different colors in a proper coloring. A graph is perfect if every induced subgraph of it 
satisfies that its clique number and chromatic number are equal. The study of perfect graphs was 
started by Berge, partly motivated by the study of the Shannon capacity in information theory, which 
lies between the clique number and chromatic number of a graph. Perfect graphs form a relatively 
large class of graphs for which several fundamental algorithmic problems which are known to be NP- 
hard for general graphs, such as the graph coloring problem, the maximum clique problem, and the 
maximum independent set problem, can all be solved in polynomial time (see [E]). Also, it has 
significant connections with the study of linear and integer programming (see, e.g., |20|). 

A famous conjecture of Berge, which was proved a few years ago by Chudnovsky, Robertson, Seymour 
and Thomas [9], states that a graph is perfect if and only if it contains no induced odd cycle of length 
at least five or the complement of one. The proof in fact establishes a stronger structural theorem 
for perfect graphs which was conjectured by Conforti, Cornuejols, and Vuskovic. It says that every 
perfect graph falls into one of a few basic classes, or admits one of a few kinds of special decompositions. 
Shortly afterwards, a proof that perfect graphs can be recognized in polynomial time (as a function 
of the number of vertices of the graph) was discovered by Chudnovsky, Cornuejols, Liu, Seymour, and 
Vuskovic [8]. 
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Another well-studied hereditary family of graphs are comparability graphs. A comparability graph is 
a graph that connects pairs of elements that are comparable to each other in a partial order. Gallai 
[13] classified these graphs by forbidden induced subgraphs, and Dilworth's theorem [TT] is equivalent 
to the statement that the complement of comparability graphs are perfect. Further, comparability 
graphs can be recognized in polynomial time (see McConnell and Spinrad [E]). Every cograph is 
a comparability graph, and every comparability graph is a perfect graph. It is natural to suspect 
that the structure theorem could hint at a polynomial in e^^ tester for perfectness similar to testing 
cographs. However, we show that testing perfectness essentially requires as much query complexity 
(or time) as testing triangle-freeness, which is hard. 

Theorem 1.2 Testing perfectness is hard. 

Indeed, Theorem 13.11 shows that from a graph on n vertices which is 14e-far from being triangle-free 
but a random sample of d vertices is with probability at least 1/2 triangle- free, we can construct a 
graph on 5n vertices which is e/25-far from being induced Cs-free but a random sample of d vertices in 
it is a comparability graph with probability at least 1/2. Since every comparability graph is perfect, 
every perfect graph is induced Cs-free, and testing triangle-freeness is hard, this implies the above 
theorem that testing perfectness is hard, and further that testing for comparability graphs is hard. 

Theorem 1.3 Testing for comparability graphs is hard. 

In the next section, we show that induced Ps-freeness is easily testable. In Section [3l we show that 
testing perfectness is at least as hard as testing triangle-freeness, which is hard. We finish with some 
concluding remarks. Throughout the paper, we systematically omit floor and ceiling signs whenever 
they are not crucial for the sake of clarity of presentation. We also do not make any serious attempt 
to optimize absolute constants in our statements and proofs. 

2 Induced Pa-freeness is easily testable 

A cut for a graph G = {V, E) is a partition V = ViL)V2 into nonempty subsets such that there are 
no edges between Vi and V2 or Vi is complete to V2. The following definition is a natural relaxation 
of a cut. For /3 > 0, define a f3-cut for a graph G = (y, E) as a partition V = ViiJV2 into nonempty 
subsets such that e{Vi,V2) < (3\Vi\\V2\ or e(Vi, V2) > (1 - /3)|¥L||y2|. For a graph G and vertex subset 
S, let G[S] denote the induced subgraph of G with vertex set S. Let c(/3, n) be the least 6 for which 
there is a graph G = {V, E) on n vertices which has no /3-cut and has 6n^ induced copies of P3 . 

Theorem 2.1 We have c{j3,n) > (/3/100)i2_ 

Proof: Suppose for contradiction that there is a graph G on n vertices which does not have a /3-cut 
and has less than 5n^ induced copies of P3, where 6 = (/3/100)^^. Since G has no /3-cut, then G 
contains an induced P3. Hence, 1 < and n > 5"^/'' > {100/(3f. 
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Since G has at most 5n^ induced copies of -P3, a random sample of r = (8(5)~^/^ > 10^ /3~^ vertices 
has in expectation at most 5r^ = 1/8 induced copies of P3. Hence, with probabihty at least 7/8, a 
random sample of r vertices contains no induced P3. 

Randomly sample a set R = SyjToir = s-\-t vertices from V , where s = t = r/2. Let Eq be the 
event that G[R\ is induced P3-free, so the probability of event Eq is at least 7/8. 

Since G does not have a /3-cut, each vertex has more than /3(n — 1) neighbors and less than (1 — /3)(n — 1) 
neighbors. Let a = j3/2. Hoeffding (see Section 6 of [H]) proved that the hypergeometric distribution 
is at least as concentrated as the corresponding binomial distribution. Thus, by the Azuma-Hoeffding 
inequality (see, e.g., [6]), and the fact that each vertex v ^ S has more than (3{n — 1) neighbors, the 
probability that a particular v & S has less than a{s — 1) neighbors in S" is a most 

g-((/3-a)(.-l))V{2(s-l)) ^ g-(/3-a)2(s-l)/2 < g-/3^s/16 < 

16s' 

Similarly, the probability that v has more than (1 — a){s — 1) neighbors in S is at most Let Ei 
be the event that every vertex in S has at least a{s — 1) and at most (1 — a){s — 1) neighbors in S, 
i.e., the induced subgraph G[S] has minimum degree at least a{s — 1) and maximum degree at most 
(1 — a){s — 1). By the union bound, the probability of event Ei is at least 1 — 2s • ^ = 7/8. 

Let U be the set of vertices v (zV\S which are complete or empty to S. As the degree of each vertex 
of G is at least /3{n — 1) and at most (1 — /3)(n — 1), the probability that for a given vertex v, a random 
subset of s vertices of \ {v} are all neighbors of v or all nonneighbors of v is at most 2(1 — fSy. 
Hence, a given vertex has probability at most 2(1 — (3)^ of being in U. By linearity of expectation, the 
expected size of U is at most 2(1 — /3)*n. Let E2 be the event that \U\ < 16(1 — /3)*n < 16e~^*n < ^n. 
By Markov's inequality, the probability of E2 is at least 1 — 1/8 = 7/8. 

Let -E3 be the event that T contains no vertex from U. By linearity of expectation, E[|f/ n T\] = 
E[\U\]t/n < 2(1 - pyt < 2e~^H < \. Therefore, event £^3 occurs with probability at least 7/8. 

The probability that events Eq and Ei both occur is at least 7/8 — 1/8 = 3/4. If both of these events 
occur, then G[S] has at least one and at most 2^ ^ cuts. Consider such a cut S* = 5i U 5*2 of G[S], and 
suppose 5i is complete to 52 (the case Si is empty to 5*2 can be treated similarly). For each such cut, 
consider the partition F \ 5 = ?7 U Vq U Vi U V2 of vertices, where v \ S satisfies v ^Vq \i v 
and it is not complete to and not complete to ^2, w G Vi if it is complete to 52 but not complete 
to 5i, and f G V2 if it is complete to 5i but not to 52. 

Note that if T contains a vertex from Vq, then the cut 5 = 5i U 52 of G[S] does not extend to a cut 
of G[R\. If events Ei for i = 0, 1, 2, 3 occur, which happens with probability at least 1/2, then G[R\ is 
induced P3-free, so it has a cut, and no vertex in T is complete or empty to 5. In this case one of the 
cuts of G\S] extends to a cut of G[B\, and hence, for at least one cut of G[S]^ no vertex of T is in the 
corresponding Vq. 

We now condition on the occurrence of events Ei for i = 0, 1, 2, 3. Note that since the probability that 
this happens is at least 1/2, for any other event E, the conditional probability that E occurs given 
that Ei occur for i = 0, 1, 2, 3 is at most twice the probability of E without any conditioning. 

To complete the proof we claim that with positive probability Eq, Ei, E2, E^ occur and yet the induced 
subgraph on 5UT contains an induced P3, contradicting Eq. To do so we apply the union bound over 
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all cuts in G[S] to show that with positive probability, for each such cut, either T contains a vertex of 
Vo (and hence the cut cannot be extended to one in or T contains a vertex vi in Vi and a vertex 

V2 in V2, which are nonadjacent, providing an induced P3 in G[S U T] on the vertices vi,V2 together 
with a vertex si G Si not adjacent to vi and a vertex S2 € <S'2 not adjacent to V2- 

We proceed with the proof of this claim. Conditioning on Ei for i = 0,1, 2, 3, fix a cut (^i, S2) in G[S] 
and let Vq, Vi, V2 be as above. Consider two possible cases. 

Case 1: |Vo| > 

In this case, the probability that T contains no vertex of Vq is at most 

(1 _ If < e-2/" < 2-^-'-\ 
at 

showing that even after our conditioning the probability of this event is smaller than 2~" ^ . 
Case 2: |Vo| < < f 

Let X = \U\ + \Vo\, y = \Si\ + |Vi|, and z = \S2\ + IV2I, so x + y + z = n. Assume without loss of 
generality that y < z. Since the partition V = {Si U Vi) U {S2 U V2 U f/ U Vq) is not a /3-cut, there are 
at least f3y{z + x) missing edges between these two sets. Since, in addition, Si is complete to ^2, Si is 
complete to V2, and Vi is complete to S2, then these missing edges go between Vi and V2 and between 
Si U Vi and UUVq. Thus 

^yn<Py{z + x) < \Vi\\V2\ - e{Vi,V2) + yx. 

If events Ei for i = 0,1,2,3 occur, then x < ju, and hence there are at least jyn missing edges 
between Vi and V2. In this case, every vertex of is complete to 6*2 U V2, and hence 

(1 — f3){n — l)>z = n — X — y>n — — y 

and 

.3/3 B 
y > — n — 1 > —n. 
^ - 4 - 2 

Thus, the number of missing pairs between Vi and V2 in the case events E^ for i = 0, 1, 2, 3 occur is at 
least jyn > ^n^. 

Let E4 be the event that T contains the two vertices of at least one of the nonedges between Vi and 
V2. Given that there are at least edges missing between Vi and V2, the probability that event E4 
occurs is at least the probability that at least one of t/2 random pairs of vertices of G contains one of 
the nonedges between Vi and V2. The probability that this does not happen is at most 



, i/2 

/3 n_/8 \ ^ _ „-/32io5 

(2) 



and hence even after our conditioning the probability of this event is smaller than 2 
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By the union bound it now follows that with positive probability Ei for i = 0, 1,2,3 occur and yet 
^[5* U r] contains an induced P3. This is a contradiction, completing the proof. □ 



Let /(e, n) be the least 6 for which there is a graph G = {V, E) on n vertices which is e-far from being 
induced Pa-free and has 5n^ induced copies of P3. 

Theorem 2.2 There is no > en such that f{e,n) > c(e,no)e^ > 

Proof: Let G = {V, E) be a graph on n vertices which is e-far from being induced Pa-free. Partition 
V into two parts along an e-cut, and continue refining parts along e-cuts of the subgraphs induced by 
the parts until no part has an e-cut, and let ^ = Vi U . . . U be the resulting partition. We modify 
edges along these e-cuts to turn them into cuts, letting G' be the resulting graph. The total fraction of 
pairs of vertices changed in making G' from G is at most e, so at least en? — £(2) ^ en? /2 edges must 
be changed from the resulting graph G' to make it induced Ps-free. We can modify edges in each Vi 
to make it induced Ps-free, and the resulting graph on V is induced Pa-free. If < en for 1 < i < /c, 
then the number of edge modifications made to G' to obtain an induced Pa-free graph is at most 



a contradiction. Thus, one of the parts Vi^ call it Vq, has no > en vertices, and (^[Vq] has no /3-cut. 
Therefore, the induced subgraph G[Vb], and hence G, has at least 



Consider the following one-sided e-tester for induced Pa-freeness. Let 5 = (e/100)^^. The algorithm 
samples t = 25"^ quadruples of vertices uniformly at random, and accepts if none of them form an 
induced Pa, and otherwise rejects. Any induced Pa-free graph is clearly accepted. If a graph is e- 
far from being induced Pa-free, then it contains at least (5?i^ induced Pa by Theorem 12.21 and the 
probability that none of the sampled quadruples forms an induced Pa is at most (1 — 5)* < 1/3. Note 
that the query complexity for this algorithm depends linearly on (5~^, and hence polynomially on e"^, 
completing the proof of Theorem 11.11 □ 

3 Testing perfectness 

We first observe a couple of equivalent versions of the triangle removal lemma. The triangle edge cover 
number ^{G) of a graph G is the minimum number of edges of G that cover all triangles in G, i.e., 
it is the minimum number of edges of G whose deletion makes G triangle-free. The triangle removal 
lemma thus says that for each e > there is 5 > such that every graph on n vertices with at most 
triangles satisfies v{G) < en^. 




c(e,no)n^ > c(e,no)e^n4 > (e/100)i2g4^4 > (g/ioo)i6„4 



induced copies of Pa, completing the proof. 



□ 
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The triangle packing number t{G) of a graph G is the maximum number of edge disjoint triangles in 
G. The following simple bounds hold for all graphs: 

t{G) < u{G) < 3r(G). 

Indeed, at least one edge from each of the edge-disjoint triangles is needed in any edge cover of the 
triangles in G, and deleting the 3r(G) edges from a maximum collection of edge-disjoint triangles 
leaves a triangle-free graph. We remark that a well known conjecture of Tuza states that the upper 
bound can be improved to z^(G') < 2r(G). Maxell |17j improved the upper bound factor to 3 — 

Thus, up to a constant factor change in e, the triangle removal lemma is the same as saying that a 
graph G on n vertices with at least en^ edge disjoint triangles contains at least 6n^ triangles. We 
can further suppose, up to a constant factor change in e, that G is tripartite. Indeed, every graph 
has a tripartite subgraph which contains at least 2/9 of the triangles in a maximum collection of 
edge-disjoint triangles. This can be seen by considering a uniform random tripartition. Each triangle 
has probability 2/9 of having one vertex in each part, so the expected number of the edge-disjoint 
triangles in the tripartition is 2/9 of the total, and there is a tripartition for which the number of 
edge-disjoint triangles is at least the expected number. We may thus assume G is tripartite. 

Theorem 3.1 Let T be a graph on n vertices which is lAe-far from being triangle-free such that a 
random sample of d vertices ofT is triangle-free with probability at least 1/2. Then there is a graph G 
on 5n vertices which is e/ 25 -far from being induced C^-free, such that a random sample of d vertices 
of G is a comparability graph with probability at least 1/2. 

Proof: By the remarks above, T contains a tripartite subgraph F which contains at least ^ • | • 14en^ > 
en^ = (e/25)(5n)^ edge-disjoint triangles. Denote the three parts of F by V2, V3, V5. 

Let G = {V-,E) be the graph on 5n vertices with partition F = Vi U V2 U V3 U V4 U V5, where Vi and V4 
are of size 2n each, and V2, V3, V5 are the parts of F. We next specify the edges between the various 
parts of G. Each part Vi, 1 < i < 5, is an independent set. There are no edges between Vi and ¥2-, 
between Vi and V3, between V3 and V4, and between V4 and V5. There is a complete bipartite graph 
between Vi and V4, between Vi and V5, and between V2 and V4. The edges of G are precisely the 
edges of F between V2 and V3, and between V3 and V5. Finally, between V2 and V5, the edges of G 
are precisely the nonedges of F. 

Arbitrarily order Ti, . . . ,Tt a maximum collection of t = t{F) > en? edge-disjoint triangles in F. As 
F is a tripartite graph on n vertices, t = t{F) is at most the product of the two smallest parts, which 
is at most n^/9. For every triangle in F, the same three vertices in G with a vertex in Vi and a 
vertex in V4 form an induced C5. We next show that this implies that there are t induced copies of 
C5 in G, labeled Li, . . . ,Lt, such that each pair intersects in at most one vertex. In fact, we greedily 
construct Li, . . . ,Lt so that they further satisfy that the vertex set of each Lj consists of the vertices 
of Ti together with a vertex in Vi and a vertex in V4. 

Suppose we have already constructed Lj for j < i satisfying the desired properties. We next show how 
to construct Lj with the desired properties. Note that in a tripartite graph, the number of edge-disjoint 
triangles containing a given vertex v is at most the minimum order of the two parts not containing v. 
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It follows that Tj has nonempty intersection with at most n of the t triangles Ti, . . . ,Tt. Hence, for 
/i = 1, 4, at most n vertices in Vh are in at least one Lj with j < i for which Tj and Tj share a vertex 
in common. For h = 1,4:, delete these vertices from Vh, and denote the resulting subset of V/j as V^, so 
\Vh\ > — n = n. Asi — l<t<n^< |V^/||V"4|, there is a pair (^1,^4) G V( x F^' that is not in any 
Lj with j < i. We pick Lj to be the induced C5 in G with vertices fi, and the vertices of Tj. It is 
clear from this construction that Lj intersects each Lj with j < z in at most one vertex. We therefore 
can greedily construct the desired t induced copies of C5, and conclude that G is e/25-far from being 
induced Cs-free. 

On the other hand, the only triples a < 6 < c of vertices in a linear ordering which puts the vertices in 
Vj before Vj\ii< j with a adjacent to b, b adjacent to c, and a not adjacent to c are with a G V2, b G V3, 
and c G V5 the vertices of a triangle in F. Thus, by sampling d vertices uniformly at random from G, 
we sample at most d vertices uniformly at random from F. These at most d vertices are triangle- free 
in F with probability at least 1/2, and hence the d random vertices in G form a comparability graph 
with probability at least 1/2. This completes the proof. □ 

As discussed toward the end of the introduction. Theorem 13.11 implies Theorem 1 1 . 2 1 that testing per- 
fectness is hard, and Theorem 11.31 that testing for comparability graphs is hard. 

A partially ordered set (poset) is a directed graph on a vertex set P which 

• has no loops, i.e., no pair {x,x) is an edge, 

• has no antiparallel edges, i.e., if {x,y) is an edge, then {y,x) is not an edge, 

• is transitive, i.e., if {x,y) is an edge and {y,z) is an edge, then {x,z) is also an edge. 

The fact that testing for posets is hard (at least as hard as testing for triangle-freeness) follows from 
Theorem 13.11 by adding directions. However, we next sketch a simpler proof. Let T be a tripartite 
graph on n vertices with parts Vi, V2, V3 which is e-far from being triangle- free. Consider the directed 
graph G on the same vertex set as T with {vi,V2) G Vi x V2 an edge of G if it is an edge of T, 
{1^2, V3) G V2 X V3 an edge of G if it is an edge of T, {vi,V3) G Vi x V3 an edge of G if it is not an 
edge of T, and there are no other edges. At least one pair in every triangle of T must be modified to 
turn G into a poset, so G is e-far from being a poset. Also, any subset of vertices which is triangle- 
free in T induces a poset in G. This implies that testing for posets is at least as hard as testing for 
triangle-freeness. 

4 Concluding Remarks 

We believe that comparing the number of queries needed to test various properties, as done in this 
paper comparing testing perfectness and triangle-freeness, could be an interesting direction for further 
research. This is the analogue in property testing to the powerful technique of hardness reductions in 
complexity theory. One general class of hard graph properties for testing for which to compare with 
is (not necessarily induced) i?-freeness for H a fixed odd cycle. 
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We showed that testing perfectness is hard. This is equivalent to showing that there is a graph which 
is e-far from being perfect such that a random set of vertices of size polynomial in is perfect with 
probability at least 1/2. This still leaves the possibility of getting a small witness if the graph is far 
from being perfect. That is, does every graph which is e-far from being perfect contain an induced 
odd cycle or its complement of size at least 5 and at most a polynomial in e~^? 

We showed that testing induced Pa-freeness is easy, which is a step toward completing the classification 
of graphs H for which induced H-hee testing is easy. It remains to determine whether or not induced 
C4-freeness is easy. 

Finally, it will be very interesting to characterize all easily testable graph properties. As all these 
properties have to be strongly testable, it follows from the main result of [5] that if we restrict ourselves 
only to natural properties, in the sense of [5], then these properties have to be essentially hereditary. 
Among the hereditary properties, properties that are known to be easily testable include the property 
of being /c-colorable for any fixed k, as shown in |14j . as well as a natural extension of it, as proved 
in [15j . As mentioned in the introduction, additional easily testable (hereditary) properties are H- 
freeness for any bipartite H, and induced ff-freeness for any path H on at most 4 vertices or its 
complement (where the case of 4 vertices is proved in Section 2). 

Hereditary properties which are not easily testable are ff-freeness for nonbipartite H, induced H- 
freeness for all graphs besides the paths on at most 4 vertices and their complements, as well as 
possibly the cycle of length 4 and its complement, perfectness and comparability. Our techniques 
here can be applied to provide several additional examples of easily testable and of non-easily testable 
hereditary properties, but most of these are somewhat artificial and not familiar graph properties. 
Does the above list of known results suggest a (conjectured) characterization of all easily testable 
hereditary graph properties? At the moment we are unable to formulate such a conjecture. 
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